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MODIFIED NUCLEOTIDES AND 
m FTHDD.S OF PREPARING AND USING SAME 



BACKGROUND OF THE INVENTION 



Many procedures employed in biomedical research and recom- 
binant DMA technology rely heavily on the use of nucleotide 
or polynucleotide derivatives radioactively labeled with 
isotopes of hydrogen ( 3 H), phosphorous ( 32 P)., carbon ( 14 C) , 
or iodine ( 125 I) . Such radioactive compounds provide useful 
indicator probes that permit the user to detect, monitor, 
localize, or isolate nucleic acids- and ether molecules of 
scientific or clinical interest, even when present in only 
extremely small amounts. To date, radioactive materials/ 
have provided the most sensitive, and in many cases the 
only, means to perform many important experimental or ana- 
lytical tests. There are, however, serious limitations and 
drawbacks associated with the use of radioactive compounds./ 
First, since personnel who handle radioactive material can 
be exposed to potentially hazardous levels of radiation, 
elaborate safety precautions must be maintained during the 
preparation, utilization, and disposal of the radioisotopes. 
Secondly, radioactive nucleotides are extremely expensive to 
purchase and use, in large part due to the cost of equipment 
and manpower necessary to provide the appropriate safeguards, 
producer/user health monitoring services, and waste-disposal 
programs. Thirdly, radioactive materials are often very 
unstable and have a limited shelf-life, which further increases 
usage costs. This instability results from radiolytic decom- 
position, due to the destructive effects associated with the 
decay of the radioisotope itself, and from the fact that ✓ 
many isotopes (e.g. 32 P and ™I) have half-lives of only a 
few days. 

It is known that haptens can combine with antibodies, but 
car. initiate an immune response only if bound to a carrier. 
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This property can be exploited in detection and identifi- 
cation testing. 

It is also known that biotin and iminobiotin strongly inter- 
act with avidin, a 68,000 dalton glycoprotein from egg white. 
This interaction exhibits one of the tightest, non-covalent 
binding constants (K<j i s = 10~ 15 ) seen in nature. If avidin is 
coupled to potentially demonstrable indicator molecules, 
including fluorescent dyes, e.g. fluorescein or rhodamine; 
electron-dense reagents, e.g. ferritin, hemocyanin, or 
colloidal gold; or enzymes capable of depositing insoluble 
reaction products, e.g. peroxidase or alkaline phosphatase, 
the presence, location, or quantity of a biotin probe can be 
established. Although iminobiotin binds avidin less tightly 
than biotin, similar reactions can be used for its detection. 
Moreover, the reversibility of the iminobiot in-avidin inter- 
action, by decreasing solution pH, offers significant advan- 
tages in certain applications. 

The specificity and tenacity of the biot in-avidin complex 
has been used in recent years to develop methods for visu- 
ally localizing specific proteins, lipids, or carbohydrates 
on or within cells (reviewed by E.A. Bayer and M. Wilchek in 
Methods of Biochemical Analysis, 26, 1, 1980) . Chromosomal 
location of RNA has been determined by electron microscopy 
using a biotinized protein, cytochrome C, chemically cross- 
linked to RNA as a hybridization probe. The site of hybrid- 
ization was visualized through the binding of avidin-f er r i t in 
or avidin-me thacr ylate spheres mediated by the avidin-biot in 
interaction. (J.E. Manning, N.D. Hershey, T.R. Broker, M. 
Pellegrini, H.K. Mitchell, and NT. Davidson, Chromosoma, 53 , 
107, 1975; J.E. Manning, M. Pellegrini, and N. Davidson, 
Biochemistry, 6_1, 1364, 1977; T.R. Broker, L.M. Angerer, 
P.H. Yen, N.D. Hersey, and N. Davidson, Nucleic Acid Res., 
5, 363, 1978; A Sodja and N. Davidson, Nucleic Acid Res., j3, 
383, 1978.) This approach to the detection of polynucleo- 




tide sequences, although successful in the specialized cases 
examined which were highly reitterated sequences, is not of 
general utility for analysis of polynucleotides present in 
single or low copy number. 

Moreover, methods for attaching chemical moieties to pyrimi- 
dine and purine rings are known. Several years ago a simple 
and rapid ace toxymercur ation reaction was developed for 
introducing covalently bound mercury atoms into the 5-position 
of the pyrimidine ring, the C-8 position of the purine ring 
or the C-7 position of a 7-deazapur ine ring, both in nucleo- 
tides and polynucleotides. (R.M.K. Dale, D.C. Livingston and 
D.C. Ward, Proc. Natl. Acad. Sci. U.S.A. , T_Q' 2238, 1973 ; 
R.M.K. Dale, E. Martin, D.C. Livingston and D.C. Ward, Bio- 
chemistry, 14, 2447, 1975.) It was also shown several years 
ago that organomer cur ial compounds would react with olefinic 
compounds in the presence of palladium catalysts to form 
carbon-carbon bonds (R.F. Heck, J. Am. Chem. Soc, 90, 5518, 
1968; R.F. Heck, Ibid., 90, 5526, 1968; R.F. Heck, Ibid., 
90 , 5531, 1968; R.F. Heck, Ibid., 90, 5535, 1968; and R.F. 
Heck, J. Am. Chem. Soc. 91, 6707, 1969.) Bergstrom and 
associates (J.L. Ruth and D.E. Berstrom, J. Org. Chem., 43 , 
2870, 1978; and D.E. Bergstrom and M.K. Ogawa, J. Am. Chem. 
Soc, 100 , 8106, 1978) and Bigge, et al . (C.F. Bigge, P. 
Kalaritis, J.R. Deck and M,P. Mertes, J. Am. Chem. Soc, 
102 , 2033, 1980) have recently applied this reaction scheme 
in the synthesis of C-5 substituted pyrimidine nucleotide 
compounds. 

Finally, it is known that antibodies specific for modified 
nucleotides can be prepared and used for isolating and 
characterizing specific constituents of the modified nucleo- 
tides. (T.W. Munns and M. K. Liszewski, Progress in Nucleic 
Acid Research and Molecular Biology, 2!4, 109, 1980.) However, 
none of the antibodies prepared to date against naturally 
occurring nucleotides have been shown to react with their 



I 



- 4 - 

nucleotide determinant when it exists in a double-stranded 
RNA or DNA duplex or when in DNA-RNA hybrid molecules. 

To circumvent the limitations of r adioac t i vely labeled probes 
5 or previously utilized chemical and biological probes, a 

series of novel nucleotide derivatives that contain biotin, 
iminobiotin, lipoic acid, and other determinants attached 
covalently to the pyrimidine or purine ring have been synthe- 
sized. These nucleotide derivatives, as well as polynucleo- 

10 tides and coenzymes that contain them, will interact specif- 
ically and uniquely with proteins such as avidin or antibodies. 
The interaction between modified nucleotides and specific 
proteins can be utilized as an alternative to radioisotopes 
for the detection and localization of nucleic acid components 

15 in many of the procedures currently used in biomedical and 

recombinant-DNA technologies. Methods employing these modif- 
ied nucleot ide-protein interactions have detection capacities 
equal to or greater than procedures which utilize radioiso- 
topes and they often can be performed more rapidly and with 

20 greater resolving power. 

These new nucleotide derivatives can be prepared relatively 
inexpensively by chemical procedures which have been devel- 
oped and standarized as discussed more fully hereinafter. 

25 More significantly, since neither the nucleotide probes of 
this invention nor the protein reagents employed with them 
are radioactive, the compounds can be prepared, utilized, 
and disposed of, without the elaborate safety procedures 
required for radioisotopic protocols.. Moreover, these nucle- 

30 otide derivatives are chemically stable and can be expected 
to have functional shelf-lives of several years or more. 
Finally, these compounds permit the development of safer, 
more economical, more rapid, and more reproducible research 
and diagnostic procedures. 
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SUMMARY OF THE INVEOTION 



Compounds having the structure 



X " CH 2 O 




wherein B represents a purine, deazapurine, or pyrimidine 
moiety covalently bonded to the (^-position of the sugar 
moiety, provided that when B* is purine or 7-deazapur ine , it 
is attached at the N 9 -position of the purine or deazapurine, 
and when B is pyrimidine, it is attached at the ^-position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable com- 
plex with a polypeptide when the compound is incorporated 
into a double-stranded ribonucleic acid, deoxyribonucleic 
acid duplex, or DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage join- 
ing B and A, provided that if B is purine the linkage is 
attached to the 8-position of the purine, if B is 7-deaza- 
purine, the linkage is attached to the 7-position of the 
deazapurine, and if B is pyrimidine, the linkage is attached 
to the 5-position of the pyrimidine; and 



wherein each of x, y r and z represents 



r 



0 0 0 

II II II 

H-, H0-, H0-P-0-, HO-P-0-P-0-, or 

1 ) I 

OH OH OH 



0 O 0 
it M II 

HO-P-O-P-O-P-0- , 

1 I I 
OH OH OH 



are 



widely useful as probes in biomedical research and recom- 
binant DNA technology. 



Particularly useful are compounds emcompassed within this 
structure which additionally have one or more of the follow- 
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ing characteristics: A is non-aromatic; A is at least C5; 
the chemical linkage joining B and A includes an a-olefinic 
bond; A is biotin or iminobiotin; and B is a pyrimidine or 
7-deazapur ine . 

These compounds may be prepared by a process which involves 
(a) reacting a compound having the structure: 

r 

B 

H H 




with a mercuric salt in a suitable solvent under 
suitable conditions so as to form a mercurated com- 
pound having the structure: 

B-Hg + 




(b) reacting said mercurated compound with a chem- 
ical moiety reactive with the -Hg + portion of said 
mercurated compound and represented by the formula 
•••N, said react ion jbeing carried out in an aqueous 
solvent and in the presence of K 2 PdCl 4 under suitable 
conditions so as to form a compound having the struc- 
ture : 



B 



x-CH 2 



N 




wherein N is a reactive terminal functional group or 
is A; and 
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(c) recovering said compound as said modified nucleo- 
tide when N is A, or when N is a reactive terminal 
group, reacting said compound with a compound having 
the structure M-A, wherein M represents a functional 
group reactive with N in an aqueous solvent under 
suitable conditions, so as to form said modified 
nucleotide, which is then recovered. 



This invention also provides compounds having the structure 

? 

HO-PH 
I 

OH 



L 




B 



it 



0-CH 




— OH 



n 



wherein each of B, B ' ', and B" represents a purine, 7-deaza- 
purine, or pyrimidine moiety covalently bonded to the ex- 
position of the sugar moiety, provided that whenever B, B 1 , 
or B" is purine or 7-deazapur ine , it is attached at the re- 
position of the purine or 7-deazapur ine , and whenever B, B 1 , 
or B" is pyrimidine, it is attached at the ^-position; 



wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable com- 
plex with a polypeptide when the compound is incorporated 



into a double-stranded duplex formed with a complementary 
ribonucleic or deoxyribonucleic acid molecule. 

wherein the dotted line represents a chemical linkage join- 
ing B and A, provided that if B is purine the linkage is 
attached to the 8-position of the purine, if B is 7-deaza- 
purine, the linkage is attached to the 7-position of the 
deazapurine, and if B is pyrimidine, the linkage is attached 
to the 5-position of the pyrimidine; 

wherein z represents H- or H0-; and 

wherein m and n represent integers from 0 up to about 
100,000. 

These compounds can be prepared by enzymatic polymerization 
of a mixture of nucleotides which include the modified nucleo 
tides of this invention. Alternatively, nucleotides present 
in oligo- or polynucleotides may be modified using chemical 
methods . 

Nucleotides modified in accordance with the practices 
of this invention and oligo- and polynucleotides into which 
the modified nucleotides have been incorporated may be used 
as probes in biomedical research, clinical diagnosis, and 
recombinant DNA technology. These various utilities are 
based upon the ability of the molecules to form stable com- 
plexes with polypeptides which in turn can be detected, 
either by means of properties inherent in the polypeptide or 
by means of detectable moieties which are attached to, or 
which interact with, the polypeptide. 

Some uses include detecting and identifying nucleic acid- 
containing etiological agents, e.g. bacteria and viruses; 
screening bacteria for antibiotic resistance; diagnosing 
genetic disorders, e.g. thalassemia and sickle cell anemia; 
chromosomal karyotyping; and identifying tumor cells. 



DETAILED DESCRIPTION OF THE INVENTION 



Several essential criteria must be satisfied in order for a 
modified nucleotide to be generally suitable as a substitute 
for a radioactively-labeled form of a naturally occurring 
nucleotide. First, the modified compound must contain a 
substituent or probe that is unique, i.e., not normally 
found associated with nucleotides or polynucleotides. 
Second, the probe must react specifically with chemical or 
biological reagents to provide a sensitive detection system. 
Third, the analogs must be relatively efficient substrates 
for commonly studied nucleic acid enzymes, since numerous 
practical applications require that the analog be enzymatic- 
ally metabolized, e.g., the analogs must function as sub- 
strates for nucleic acid polymerases. For this purpose, 
probe moieties should not be placed on ring positions that 
sterically, or otherwise, interfere with the normal Watson - 
Crick hydrogen bonding potential of the bases. Otherwise, 
the substituents will yield compounds that are inactive as 
polymerase substrates. Substitution at ring positions that 
alter the normal "anti" nucleoside conformation also must be 
avoided since such conformational changes usually render 
nucleotide derivatives unacceptable as polymerase substrates. 
Normally, such considerations limit substitution positions 
to the 5-position of a pyrimidine and the 7-position of a 
purine or a 7-deazapur ine . 

r 

Fourth, the detection system should be capable of interact- 
ing with probe substituents incorporated into both single- 
stranded and double-stranded polynucleotides in order to be 
compatible with nucleic acid hybridization methodologies. 
To satisfy this criterion, it is preferable that the probe 
moiety be attached to the purine or pyrimidine through a 
chemical linkage or "linker arm" so that it can readily 
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interact with antibodies, other detector proteins, or chem- 
ical reagents. 

Fifth, the physical and biochemical properties of polynucleo- 
tides containing small numbers of probe substituents should 
not be significantly altered so that current procedures 
using radioactive hybridization probes need not be exten- 
sively modified. This criterion must be satisfied whether 
the probe is introduced by enzymatic or direct chemical 
means . 

Finally, the linkage that attaches the probe moiety should, 
withstand all experimental conditions to which normal nucleo- 
tides and polynucleotides are routinely subjected, e.g., 
extended hybridization times at elevated temperatures, phe- 
nol and organic solvent extraction, electrophoresis, etc. 

All of these criteria are satisfied by the modified nucleo- 
tides described herein. 

These modified nucleotides have the structure: 

B # **A 




y z 

wherein B represents a, purine, 7- deazapurine, or pyrimidine 
moiety covalently bonded to the (^'-position of the sugar 
moiety, provided that when B is purine or 7-deazapur ine , it is 
attached at the N^-position of the purine or 7-deazapur ine , 
and when B is pyrimidine, it is attached at the ^-position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable com- 
plex with a polypeptide when the. compound is incorporated 




) 

into a double-stranded ribonucleic acid, deoxyribonucleic 
acid duplex, or DNA-RNA hybrid; 
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-P-0 


-P- 
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OH 


OH 


OH 



wherein the dotted line represents a linkage group joining B 
and A, provided that if B is purine the linkage is attached 
to the 8-position of the purine, if B is 7-deazapur ine , the 
linkage is attached to the 7-position of the deazapurine, 
and if B is pyrimidine, the linkage is attached to the 5- 
position of the pyrimidine; and 

wherein each of x, y and z represents 

0 0 0 

II (( li 

H-, H0-, H0-P-0-, H0-P-0-P-0-, or HC 

1 I I 

OH OH OH 

These compounds are widely useful as probes in biomedical 
research and recombinant DNA technology. 

Although in principal all compounds encompassed within this 
20 structural formula may be prepared and used in accordance 
with the practices of this invention, certain of the com- 
pounds are more readily prepared or used or both, and there- 
fore are presently preferred. 

25 Thus, although purines, pyrimidines and 7-deazapur ines are 
in principal useful, pyrimidines and 7-deazapur ines are 
preferred since purine substitution at the 8-position tends 
to render the nucleotides ineffective as polymerase substrates 
Thus, although modified purines are useful in certain respects, 

30 they are not as generally useful as pyrimidines and 7-deaza- 
purines. Moreover, pyrimidines and , 7-deazapur ines useful in 
this invention must not be naturally substituted at the 5- 
or 7- positions, respectively. As a result, certain bases 
such as thymine, 5-methylcytosine , and 5-hydroxyme thyl- 

35 cytosine are not useful. Presently preferred bases are 
cytosine, uracil, deazaadenine and deazaguanine . 



t 



i 




- 12 - 



A may be any moiety 'which has at least three carbon atoms 
and is capable of forming a detectable complex with a poly- 
peptide when the modified nucleotide is incorporated into a 
double-stranded duplex containing either deoxyribonucleic or 
ribonucleic acid. 

A therefore may be any ligand which possesses these prop- 
erties, including haptens which are only immunogenic when 
attached to a suitable carrier, but are capable of interact- 
ing with appropriate antibodies to produce complexes. Ex- 
amples of moe'ities which are useful include: 




O 



NH 



20 





NO 



2 



NO 



2 



25 



-C-CH 2 -CH 2 C-0--; 

o o 




and 



30 



O 



35 



Of these 
biotin and iminobiotin. 




the preferred A moieties are 




715DX 
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Moreover, since aromatic moieties tend to intercalate into a 
base-paired helical structure, it is preferred that the 
moiety A be nonaromatic. Also, since smaller moieties may 
not permit sufficient molecular interaction with polypep- 
5 tides, it is preferred that A be at least C5 so that suf- 
ficient interaction can occur to permit formation of stable 
complexes, Biotin and iminobiotin satisfy both of these cri- 
teria. 

10 The linkage or group joining moiety A to base B may include 
any of the well known bonds including carbon-carbon single 
bonds, carbon-carbon double bonds, carbon-nitrogen single 
bonds, or carbon-oxygen single bonds. However, it is gen- 
erally preferred that the chemical linkage include an ole- 

15 finic bond at the ot-position relative to B. The presence of 
such an ot-clefinic bond serves to hold the moiety A away 
from the base when the base is paired with another in the 
well known double-helix configuration. This permits inter- 
action with polypeptide to occur more readily, thereby fa- 

20 cilitating complex formation. Moreover, single bonds with 
greater rotational freedom may not always hold the moiety 
sufficiently apart from the helix to permit recognition by 
and complex formation with polypeptide. 

25 It is even more preferred that the chemical linkage group be 

derived from a primary amine, and have the structure -CH2~NH~r 
since such linkages are easily formed utilizing any of the 
well known amine modification reactions. Examples of 
preferred linkages derived from allylamine and allyl-(3- 

30 amino-2-hydroxy-l-propy 1 ) ether groups have the formulae 



-CH=CH-CH 2 -NH- and -CH=CH-CH 2 -0-CH 2 -CH-CH2-NH- , 

OH 

respectively. 




Although these linkages are preferred, others can be used, 
including particularly olefin linkage arms with other modi- 
fiable functionalities such as thiol, carboxylic acid, and 
epoxide functionalities. 

The linkage groups are attached at specific positions, 
namely, the 5-position of a pyrimidine, the 8-position of a 
purine, or the 7-position of a deazapurine. As indicated 
previously, substitution at the 8-position of a purine does 
not produce a modified nucleotide which is useful in all the 
methods discussed herein. It may be that the 7-position of 
a purine, which is occupied by a nitrogen atom, could be the 
point of linkage attachment. However, the chemical substi- 
tution methods employed to date and discussed herein are not 
suitable for this purpose. 

The letters x, y, and z represent groups attached to the 5 1 , 
3", and 2' positions of the sugar moiety. They may be any 
of 

0 0 0 0 0 0 

U U II (I It || 

H-, HO- , H0-P-0-, H0-P-0-P-0-, or HO-P-O-P — P-0-. 

\ It III 

OH OH OH OH OH OH 

Although conceivable, it is unlikely that all of x, y, and z 
will simultaneously be the. same. More likely at least one 
of x, y, and z will be a phosphate-containing group, either 
mono-, di-, or tr i-phosphate and at least one will be HO- or 
H-. As will be readily appreciated, the most likely identity 
of z will be HO- or H- indicating ribonucleotide or deoxy- 
r ibonucleotide , respectively. Examples of such nucleotides 
include 5 ' -r ibonucleoside monophosphates, 5 ' -r ibonucleoside 
diphosphates , 5 ' -r ibonucleoside tr iphosphates , 5 ' -deoxy- 
r ibonucleoside monophosphates, * 5 1 -deoxyr ibonucleoside 
diphosphates , 5 ' -deoxyr ibonucleoside tr iphosphates , 5 ' p- 
r i be nucleoside- 3 1 p, and 5 ' p-deoxyr ibonucleoside- 3 1 p . More 
specific examples include modified nucleotides of this type 
in which A is biotin or iminobiotin, the chemical linkage is 



-CH=CH-CH 2 -NH- or -CH=CH-CH 2 -C-CH 2 -<pH-CH 2 -NH- , 

OH 

and B is uracil or cytosine. 



The general synthetic approach adopted for introducing the 
linker arm and probe ir.oiety onto the base is discussed here- 
inabove. (See especially, J.L. Ruth and D.E. Bergstrom, J, 
Org. Chem., 43, 2870, 1978; D.E. Bergstrom and M. K. Ogawa, 
J. Amer. Chem. Soc. 100 , 8106, ,1978; and C.F. Bigge, P. 
Kalaritis, J.R. Deck and M. P. Mertes, J. Amer. Chem. Soc. 
102 , 2033, 1980.) However, the olefin substituents employed 
herein have not been used previously. - To facilitate attach- 
ment of probe moiety A, it has been found particularly 
desirable to employ olefins with primary amine functional 
groups, such as allylamine [AA] or allyl- (3-amino-2- / 
hydroxy-l-propyl ) ether [NAGE], which permit probe attach- 
ment by standard amine modification reactions, such as, 



II 2 

CH 2 NH 2 + R-C-OR ■ 

Imida te 



Sh 

II 

-CH 2 NHCR 



-CH 2 NH 2 + 



O 
II 

R-C 



O 

\ II 
O. >• -CH 2 NHCR 



CH 2 NH 2 + R-<J 

O 



/ 



Arihydr ide 



O 



? 

NOCR 



O 
il 

CH 2 *JHCR 



NHS-ester ( N-hydroxysuccinimide) 



S 
II 

CH 2 NH 2 + R-i\ T =C=S— > -CH 2 NHCNHR 
Isothiocyanate 



A ? H 

-CH 2 NH 2 + L — J> — R — > -CH 2 NHCH 2 CHR 

Epoxide 

Because of ease of preparation it has been found preferable 
to use NHS-esters for probe addition. However, olefin linker 
arms with other modifiable functional groups, such as thiols, 
carboxylic acids, epoxides, and the like, can also be em- 
ployed. Furthermore, both linker arm and. probe can be added 
in a single-step if deemed desirable. 



Specifically, modified nucleotides having the structure: 

_ B • • • A 

x-CH-, « 



a hS 



H 



wherein B represents a purine, 7-deazapur ine , or pyrimidine 
moiety covalently bonded to the C 1 ' -position of the sugar 
moiety, provided that when B is purine or 7-deazapur ine , it 
is attached at the N 9 -position of the purine or deazapurine, 
and when B is pyrimidine, it is attached at the Nl-position ; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable com- 
plex with a polypeptide when the compound is incorporated 
into a double-stranded ribonucleic acid, deoxyribonucleic 
acid duplex, DNA-RNA hybrid; 

wherein the dotted line represents a chemical linkage join- 
ing B and A, provided that if B is purine, the linkage is 
attached to the 8-position of the purine, if 7-deazapur ine , 
the linkage is attached to the 7-position of the deaza- 
purine, and if B is pyrimidine, the linkage is attached to 
the 5-position of the pyrimidine; and 
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wherein each of x f y, and z represents 



0 
II 

H-, H0-, HO-P-O-, 

OH 

can be prepared by: 



o 
II 



HO-P-O-P-O- 



OH 



A 



H 



O O 

or HO-P-O-P-O-P-0 

i 1 1 
OH OH 



OH 



(a) reacting a compound having the structure: 

, B 

x-CH^o 

H H 




with a mercuric salt in a suitable solvent under 
suitable conditions so as to form a mercurated com- 
pound having the structure: 

B-Hg+ 



x-CH 



H 



H 



y z 

(b) reacting said mercurated compound with a chem- 
ical moiety reactive with the -Hg + portion of said 
mercurated compound and represented by the formula 
* • * N, said reaction being carried out in an aqueous 
solvent and in the presence of K2?dCl4 under suitable 
conditions so as to form a compound having the struc- 
ture: B • • • N 

0 



x-CH 




wherein N is a reactive terminal functional group or 
is A; and 



(c) recovering said compound as said modified nucleo 
tide when N is A f or when N is a reactive terminal 
group, reacting said compound with a compound having 
the structure M-A, wherein M represents a functional 
group reactive with N in an aqueous solvent under 




suitable conditions, so as to form said modified 
nucleotide, which is then recovered. 

The following schema is illustrative: 



HN 



Relative 
concentration 




15 



20 



25 



K 2 PdCl 4 

Allylamine 

R.T. 
18-24 hr. 

Acetate buffer, pH 4-5 



dCl 



li 2 

CH 

I 

CH 2 NH 2 



Unstable 




1 

>10 



30 



Biotin- 
NHS ester 




C ^ C ^ CH 2-N-C-{CH 2 ) 4 




35 



Although the reactions can be carried out at hydrogen ion 
concentrations as low as pH 1, or as high as pH 14, it is 
preferred to operate in the range from about 4 to 8 . 
This is especially true when dealing with unstable compounds 
such as nucleoside polyphosphates, polynucleotides, and nucleo 
tide coenzymes which are hydrolyzed at pH's outside this 
range. Similarly, it is preferred to operate at a temper- 
ature in the range from about 20° C to 30° C to avoid pos- 
sible decomposition of labile organic substrates. However, 
the reactions can be carried out at temperatures from about 
5° C to 100° C. As Hs usual with chemical reactions, higher 
temperatures promote the reaction rate and lower tempera- 
tures retard it. Thus, in the temperature range from 5° C 
to 100° C, the optimum reaction time may vary from about 10 
minutes to 98 hours. In the preferred temperature range, 
reaction times normally vary from about 3 to 24 hours. 

The preferred procedure for maintaining the pH in the de- 
sired range is through the use of buffers. A variety of 
buffers can be employed. These include, for example, sodium 
or potassium acetate, sodium or potassium citrate, potassium 
citrate-phosphate, tris-acetate and bor ate-sodium hydroxide 
buffers. The concentration of buffer, when employed, can 
vary over a wide range, up to about 2.0 molar. 

While a particular advantage of the mercuration and palladium 
catalyzed addition reactions is that they can be carried out 
in water, small amounts of an organic solvent can be usefully 
included as a solubility aid. The organic solvents usually 
chosen are those which are miscible with water. These may 
be selected from ethers, alcohols, esters, ketones, amides, 

and the like such as methanol, ethanol, propanol, glycerin, 
dioxane, acetone, pyridine and dimethylf ormamide . However, 
since it has been observed that the presence of alcohols, 
such as methanol, often results in alkoxy-addit ion across 
the olefin double bond, any organic solvent used as a sol- 
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ubility aid should be chosen carefully. Introduction of 
alkoxy substituents to the a - or 6- exocyclic carbon atoms 
often results in the production of compounds which are uti- 
lized much less efficiently as enzyme substrates. 

Although various mercuric salts may be utilized, the pres- 
ently preferred salt is mercuric acetate. Also, as indi- 
cated previously, the compounds may be prepared by first 
adding a linker arm and then the moiety A, or by adding a 
linker arm to which A is already attached. Thus, the chem- 
ical moiety represented by the formula ### N may be any one 
of the numerous entities which ultimately result in pro- 
duction of the desired compounds. 

Examples include l -CH=CH-CH 2 -NH 2 , 

-CH=CH-CH 2 -0-CH 2 -CH-CH 2 -NH 2 , -CH=CH-CH 2 -NH-biot in , and 

OH 

-CH=CH 2 -CH 2 -0-CH 2 -CH-CH 2 -NH-iminobiotin. 

OH 

The amounts of the reactants employed in these reactions 
may vary widely. However, in general the amounts of unmercur- 
ated compound, mercurated compound, and palladium-containing 
compound will be substantially stoichiometric whereas the 
mercuric salt and compound * • • N will be present in molar 
excess, e.g. 5-20 moles of • • • N or of mercuric salt per 
mole of mercurated compound or unmercurated compound, respect- 
ively. In practice, amounts will vary depending upon varia- 
tions in reaction conditions and the precise identity of 
the reactants . 

Having the biotin probe directly attached to nucleotide 
derivatives that are capable of functioning as enzyme sub- 
strates offers considerable versatility, both in the exper- 
imental protocols that can be performed and in the detection 
methods (microscopic and non-microscopic) that can be 
utilized for analysis. For example, biotin nucleotides 
can be introduced into polynucleotides which are in the 
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process of being synthesized by cells or crude cell ex- 
tracts, thus making it possible to detect and/or isolate 
nascent (growing) polynucleotide chains. Such a procedure 
is impossible to do by any direct chemical modification 
method. Furthermore, enzymes can be used as reagents for 
introducing probes such as biotin into highly selective or 
site-specific locations in polynucleotides; the chemical 
synthesis of similar probe-modified products would be ex- 
tremely difficult to achieve at best. 

The synthesis of nucleotides containing biotin or imino- 
biotin was achieved as detailed in the examples set forth 
hereinafter. Pyrimidine nucleoside triphosphates con- 
taining either of these probes attached to the C-5 carbon 
atom were good to excellent substrates for a wide variety of 
purified nucleic acid polymerases of both prokaryotic and 
eukaryotic origin. These include DNA polymerase I of E. 
ccxli, bacteriophage T4 DNA polymerase, DNA polymerases a and $ 
from murine (A-9) and human (HeLa) cells, and the DNA poly- 
merase of Herpes simplex virus. Confirming data were obtain- 
ed with E. col i DNA polymerase I using either the nick- 
translation condition of Rigby, et al . (P.W.J. Rigby, M. 
Dieckmann, C. Rhodes and P. Berg, J. Mol . Biol. 113 , 237, 
1977) or the gap-filling reaction described by Bourguignon 
et al. (G.J. Bourguignon, P.J. Tattersall and D.C. Ward, J. 
Virol. 20, 290, 1976) . Bio-dUTP has also been found to 
function as a polymerase substrate both in CHO cells, permea- 
bilized by treatment with lysolecithin according to the 
method of Miller, et al . (M.R. Miller, J.C. Castellot, Jr. 
and A.B. Pardee, Exp. Cell Res. 120 , 421, 1979) and in a nu- 
clear replication system prepared from Herpes simplex in- 
fected BHK cells. Although biotinyl ribonucleoside tri- 
phosphates were found to function as substrates for the RNA 
polymerases of E. coli and bacteriophage T7 , they are not 
utilized as efficiently as their deoxyr ibonucleot ide tri- 
phosphate counterparts. Indeed, they are incorporated 




poorly, if at all., by the eukaryotic RNA polymerases ex- 
amined (HeLa cell RNA polymerase III, calf thymus RNA poly- 
merase II and mouse cell RNA polymerase II), While this 
limited range of substrate function does restrict the util- 
ity in some _in vivo or in vitro transcription studies, biotin 
labeled RNA probes can be prepared enigmatically from DNA 
templates using E. coli or T7 RNA polymerases or by 3 ' end- 
labeling methods using RNA ligase with compounds such as 
biotinyl-pCp. The AA- and NAGE-der ivat ives of UTP are, how- 
ever, substrates for the eukaryotic RNA polymerases men- 
tioned above. With the availability of antibodies to these 
analogs, the isolation of nascent transcripts by immuno- 
logical or affinity procedures should be feasible. 

The enzymatic polymerization of nucleotides containing bio- 
tin or iminobiotin subst ituetnts was not monitored directly, 
since neither of these probes were radiolabeled. However, 
two lines of experimental evidence clearly show that the 
biotinyl-nucleotides were incorporated. The first is that 
polynucleotides synthesized in the presence of biotin- 
nucleotides are selectively retained when chromatogr aphed 
over avidin or streptavidin affinity columns. (Tables I and 
II) For example, whereas normal DNA, nick translated with 
32 P-dAMP, is quantitatively eluted upon the addition of 0.5 
M NaCl, the vast majority of biotinyl-DNA or iminobiot inyl- 
DNA remains bound to the resin even. after extensive washing 
with high salt, urea, quanidine-HCl , formamide or 50 mM 
NaOH. The small fraction of the radiolabel eluted by these 
washing conditions is not retained when applied to the resin 
a second time, suggesting that radioactivity is associated 
with DNA fragments which are free of biotin substitution. 
The second line of evidence is that only biot in-labeled 
polynucleotides are immunoprecipi tated when treated with 
purified anti-biotin IgG followed by formalin-fixed Staphy - 
lococcus aureus . (Table III) It is clear from the data in 
these tables that extremely small amounts of biotin can be 
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detected by this method. These results also show that the 
biotin molecule can be recognized by avidin, streptavidin or 
specific antibodies while the DNA is still in its native, 
double-stranded form, a condition that is absolutely essen- 
tial if the antibody-binding or avidin-af f ini ty approaches 
are to be useful in probe detection employing hybridiza- 
tion techniques. 

TABLE ,1 



SELECTIVE RETENTION OF BIOTINIZED DNA 

ON AVIDIN-SEPHAROSE 



15 



20 



Eluent 



Load - 



% DNA Retained on Resin 



(1) 
(2) 

< 3 > 
(4) 

(6) 
25 (7) 



3 x 10 5 cpm 
10 mM Tris 7.5 
+ 0 . 2 M NaCl 

0.5 M NaCl 
1.0 M NaCl 
8 M Urea 

6 M guanidine-HCl 
99% formamide 
2 mM Biotin 
50 mM NaOH 



Bio-DNA (1%) 



100 



100 
99 

100 
95 
94 
97 
89 



2 
7 
6 
5 



T-DNA 



100% 



0.1 

<0.01 
<0. 01 
<0. 01 
<0. 01 
<0.01 
<0.01 



# 
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TABLE II 

Affinity Chromatography of Iminobiot in-dUTP 
and Iminobiotinized - DNA on Streptavidin-Sepharose 



15 



Eluent 



Load - 



10 (1) 

(2) 

(3) 

(4) 
(5) 



(6) 



10 mM Tris-HCl, 8. 
50 mM NaCl 
0.1 M NaCl 
1.0 M NaCl 
8 M Urea 

6 M guanidine-HCl 
50 mM NH 4 -acetate, 

pH 4.0 
50 mM NH4~acetate, 

pH 4.0 
2 mM biotin 



% Retained on SA-Sepharose 

T-DNA 3 H-IB-dUTP IB-DNA 



8.7 
<0. 1 
<0.01 
<0. 01 
<0. 01 

< 0 . 01 
<0.01 



100 

100 

100 
97.5 
97.0 

<0. 01 
<0. 01 



99 
99 
99 
98 



7 
7 
4 
5 



97. 0 



96. 5 



<0. 01 



20 



TA B LE III 

SELECTIVE IMMUNOPRECIPITATION OF BIO-DNA 
WITH ANTI-BIOTIN IgG and STAPH AUREUS 



25 



DNA 



Antibody 



CPM in 
Immuno ppt 



CPM in 
Supernatant 



T-DNA 
T-DNA 
30 T-DNA 



Anti-Bio IgG 
Non- immune IgG 



70 
87 
55 



4867 
5197 
5107 



Bio-DNA 
Bio-DNA 
Bio-DNA 



35 



Anti-Bio IgG 
Non- immune IgG 



53 
3347 
60 



3886 
736 
3900 



*N.T. pBR-322 DNA, 32 p _ labeled; 1% Biotin substitution. 
Specific activity, 2 x 10 7 cpra/yg 
Biotin detection 0.001-0.01 pmoles. 
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Thus, it is possible to prepare novel compounds having the 
structure: 




OH 



n 



wherein each of B, B 1 , and B" represents a purine, deaza>- 
purine, or pyrimidine moiety covalently bonded to the ex- 
position of the sugar moiety, provided that whenever B, B", 
or B" is purine or 7-deazapur ine , it is attached at the im- 
position of the purine or deazapur ine , and whenever B, B', 
or B" is pyrimidine, it is .attached at the ^-position; 

wherein A represents a moiety consisting of at least three 
carbon atoms which is capable of forming a detectable com- 
plex with a polypeptide when the compound is incorporated 
into a double-stranded duplex formed with a complementary 
ribonucleic or deoxyribonucleic acid molecule. 



wherein the dotted line represents a linkage group join- 
ing B and A, provided that if B is purine* the linkaqe is 
attached to the 8-position of the purine, if B is 7-deaza- 
purine, the linkage is attached to the 7-position of the 



5 



10 



* 
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deazapurine, and if B is pyrimidine, the linkage is attached 
to the 5-position of the pyrimidine; 

wherein z represents H- or HO- ; and 

wherein m and n represent integers from 0 up to about 
100 , 000 . 

Of course, it should be readily understood that in general m 
and n will not simultaneously be 0 since, in that event, the 
compound becomes merely a modified nucleotide as described 
previously. In general B 1 and B" will vary within the same 
oligo- or polynucleotide, being alternatively uracil, cyto- 
sine, thymine, guanine, adenine, or the like. Also, in gen- 
15 eral, the variation will correspond to the ordered sequence 
of nucleotides which codes for the synthesis of peptides 
according to the well known Genetic Code. However, it is 
intended that the structure shown also embrace polynucleo- 
tides such as poly C, poly U, poly r (A-U) , and poly d(A-U) 
20 as well as calf thymus DNA, ribosomal RNA of E. coli or 

yeast, bacteriophage RNA and DNA (R17, fd) , animal viruses 
(SV40 DNA), chromosomal DNA, and the like, provided only 
that the polynucleotides be modified in accordance with this 
invention. 

25 

It is also to be understood that the structure embraces more 
than one modified nucleotide present in the oligomer or 
polymer, for example, from two to thirty modified nucleo- 
tides. The critical factor in this regard is that the number 
30 of modifications not be so great that the polynucleotide is 
rendered ineffective for the intended use. 

Finally, it should be understood that modified oligo- and 
polynucleotides can be joined to form larger entities having 
35 the same structure so long as terminal groups are rendered 
compatible or reactive. 
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These compounds can be made by enzymatic polymerization of 
appropriate nucleotides, especially nucleotide triphosphates 
in the presence of a nucleic acid template which directs 
synthesis under suitable conditions. Such conditions can 
5 vary widely depending upon the enzyme employed, amounts of 
nucleotides present, and other variables. Illustrative 
enzymes include DNA polymerase I of E. coli , bacteriophage 
T4 DNA polymerase, DNA polymerases ct and 8 from murine and 
human (HeLa) cells, DNA polymerase from Herpes simplex 
10 virus, RNA polymerase of E. coli , RNA polymerase of bacter- 
iophage T7, eukaryotic RNA polymerase including HeLa cell 
RNA polymerase III., calf thymus RNA polymerase II, and mouse 
cell RNA polymerase II. 

15 Also, the compounds can be prepared by terminal addition to 
oligo- or polynucleotides to produce compounds in which m or 
n is 0 depending upon whether the addition is at the 5 1 or 
3' position. Moreover, the compounds such as pCp or pUp in 
which the base is biotinized can be added to existing mole- 

20 cules employing the enzyme RNA ligase. 

Modified oligo- and polynucleotides can also be prepared by 
chemical modification of existing oligo- or polynucleotides 
using the approach described previously for modification of 
25 individual nucleotides. 

The various modified nucleotides, oligonucleotides, and 
polynucleotides of this invention may be detected by con- 
tacting the compounds with polypeptides which are capable of 
30 forming complexes therewith under suitable conditions so as 
to form the complexes, provided that the polypeptides in- 
clude one or more moieties which can be detected when the 
complex or complexes is or are formed, generally by means of 
conventional detection techniques. 

35 

One polypeptide detector for the biotiny 1- type probe is 
avidin. The avidin-biotin interaction exhibits one of the 



tightest non-covalent binding constants (K dis =10= 15 ) seen ir 
nature. If avidin is coupled to potentially demonstrable 
indicator molecules, e.g., fluorescent dyes (fluorescein, 
rhodamine) electron-dense reagents (ferritin, hemocyanin, 
colloidal gold) , or enzymes capable of depositing insoluble 
reaction products (peroxidase, alkaline phosphatase) the 
presence, location and/or quantity of the biotin probe can 
be established. 

Avidin has, unfortunately, one property that makes it less 
desirable as a biot in-indica tor protein when used in con- 
junction with nucleic acids or chromatin material. It has 
been reported (M.H. Heggeness, Stain Technol . , 52, 165, 
1977; M.H. Heggeness and J.F. Ash, J. Cell. Biol., 73, 783, 
1977; E.A. Bayer and M. Wilchek, Methods of Biochemical 
Analysis 26, 1, 1980) that avidin binds tightly to condensed 
chromatin or to subcellular fractions that contain large 
amounts of nucleic acid in a manner which is independent of 
its biotin-binding property. Since avidin is a basic glyco- 
protein with a pi of 10.5, its histone-like character or its 
carbohydrate moieties are most likely responsible for these 
observed non-specific interactions. 

A preferred probe for biotin-containing nucleotides and 
derivatives is streptavidin> an avidin-like protein syn- 
thesized by the soil organism Streptomyces avidinii . its 
preparation and purification is described in Hoffman, et 
al., Proc. Natl. Acad. Sci., 77, 4666 (1980). Streptavidin 
has a much lower pi (5.0), is non-glycosyla ted , and shows 
much lower non-specific binding to DNA than avidin, and 
therefore offers potential advantages in applications in- 
volving nucleic acid detection methodology. 

A most preferred protein for biotin-like probe detection is 
monospecific rabbit IgG, antibiotin immunoglobulin. This 
compound was prepared by immunizing rabbits with bovine 
serum albumin conjugated biotin as described previously (M. 
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Berger, Methods in Enzymology, 62, 319 [1979]) and purified 
by affinity chromatography. Although the association con- 
stant of immunoglobulin-haptens have values of K (10 6 to 
_ _ assn 

10 xu ) which are considerably lower than for avidin-biotin 
complexes, they are substantially equivalent to those ob- 
served with the avidin-iminobiot in complex. Furthermore, 
the anti-biotin antibodies have proven extremely useful in 
detecting specific polynucleotide sequences on chromosomes 
by in situ hybridization since little, if any, non-specific 
binding of the antibody to chromatin material occurs. 

The modified polynucleotides of this invention are capable 
of denaturation and renaturation under conditions compatible 
with their use as hybridization probes. An analysis of the 
thermal denaturation profiles and hybridization properties 
of several biot in-substituted DNA and RNA polymers clearly 
indicates this. For example, pBR 322 DNA or X DNA, nick 
translated to introduce approximately 10-100 biotin residues 
kilobase, have Tm values essentially identical to that of 
the control, biotin-free DNAs. Furthermore, 32p_i a b e i e( 3 9 
biotin-subst ituted , pBR 322 DNA/ exhibited the same degree of 
specificity and autoradiographic signal intensity as con- 
trol, thymidine-containing DNA y when used as a hybridization 
probe for detecting bacterial colonies containing the plas- 
mid . 

In DNA duplexes, such as MVM RF DNA, in which every thymidine 
residue in one strand (1250 in toto per 5 Kb) is replaced 
by a biotinyl-nucleot ide , the Tm is only 5° C less than 
that of the unsubs ti tuted control. Although the Tm of 
poly d (A-bioU) in which each base pair contains a bio-dUMP 
residue is 15° C lower than the poly d(A-T) control, the 
degree of cooperat ivity and the extent of hyperchromici ty 
observed both during denaturation and renaturation were 
the same for the two polymers. A parallel analysis of 



RNA duplexes and DNA/RNA hybrids indicates that their Tin's 
also decrease as the biot in-content of the polymer increases 
However, it is clear that a substantial number of biotin- 
molecules can be introduced without significantly altering 
the hybridization characteristics of the polymers. 

These results strongly suggested that biotin-subst i tuted 
polynucleotides could be used as probes for detecting and/or 
localizing specific polynucleotide sequences in chromosomes, 
fixed cells, or tissue sections. The general protocol 
for detecting the biot in-substituted probe is schematically 
illustrated as follows: 

GENERAL PROTOCOL FOR PROBE DETECTION 
VIA IN 5Y7Y/, COLON YjOR NORTHERN /SOUTHERN 

HYBRIDIZAYiON METHODS 



Anti probe sequence 

BnDBHflKBR 



I) Target 

Delivery 



Hybridize with biotinized or 
hoptenized probe (with or with- 
in out cloning vechicle sequences) 




6 6 6 



2) Signal 

Ampli f icat ion 




— O = Biottn or 
HQpte ne 



1) Avidin - peroxidase 

2) Ig G - peroxidase 

v 3) Primary cf-deter minent Ig G 



-a 




6 .6. A. .6. 6 





3) Detection : I), Insoluble peroxidase products : DAB 

2) Antibody sondwiching techniques 



This general scheme illustrates only procedures used for 
gene mapping (cytogenetics), and recombinant DNA-technol- 
ogies. However, it can be equally well applied to the de- 
tection of nucleic acid sequences of bacterial, viral, 
5 fungal or parasite origin in clinical samples and this forms 
the basis of a powerful new approach to clinical diagnostics 
which does not rely on the use of radioisotopes. 

Immunological and histochemical methods for the detection of 
10 biotin have shown that the basic approach is useable for a 
rapid method of gene mapping in situ hybridization and non- 
radioactive procedures for detecting specific nucleic acid 
sequences by blotting hybridization methods. Use may be 
made of this technology in development of new clinical diag- 
15 nostic procedures. \ 

Using this approach / it is possible to determine the presence 
of a specific deoxyribonucleic or ribonucleic acid molecule, 
particularly such a molecule derived from a living organism, 
20 e.g. bacteria, fungus, virus, yeast, or mammal. This in 

turn permits diagnosis of nucleic acid-containing etiolog- 
ical agents in a patient or other subject. 

Moreover, it provides a -method for screening bacteria to 
25 determine antibiotic resistance. Thus, for example, peni- 
cillin resistance in Streptococcus pyogenes or Neisser is 
meningitidis ; tetracycline resistance in Staphylococcus 
aureus , Candida albicans , Pseudomonas aer ug inosa , Strep - 
tococcus . pyogenes , or Neisser ia gonorrhoeae ; and amino- 
30 glycoside resistance in Mycobacter ium tuberculosis can be 
determined. 

In these methods a polynucleotide is prepared which is 
complementary to the nucleic acid sequence which charact- 
35 erizes the organism or its antibiotic resistance and which 





additionally includes one or more modified nucleotides 
according to this invention. This polynucleotide is 
hybridized with nucleic acid obtained from the organism 
under scruntiny. Failure to hybridize indicates absence 
of the organism or of the resistance characteristic. 
Hybridized nucleic acid duplexes are then identified by 
forming a complex between the duplex and a suitable 
polypeptide which carries a detectable moiety, and detect- 
ing the presence of the complex using an appropriate de- 
tection technique. Positive detection indicates that the 
complex, the duplex and therefore the nucleic acid se- 
sequence of interest are present. 

This approach can be extended to the diagnosis of genetic 
disorders, such as thalassemia and sickle cell anemia. 
The deoxyr ibonucleot ide acid gene sequence whose presence 
or absence (in the case of thalassemia) is associated with 
the disorder can be detected following hybridization with 
a polynucleotide probe according to this invention based 
upon complex formation with a suitable detectable poly- 
peptide. 

The mapping of genes or ,their transcripts to specific 
loci on chromosomes has been a tedious and time-con- 
suming occupation, involving mainly techniques of cell- 
fusion and somatic cell genetics. Although in situ 
hybridization has been employed successfully for mapping 
single-copy gene sequences in species that- undergo 
chromosomes polytenization, such as Drosophila , de- 
tection of unique sequence genes in most higher eukaryotic 
chromosomes has been extremely difficult, if not impossible, 
using standard hybrization methods. The necessity for 
polynucleotide probes of very high specific radioactivity 
to facilitate autoradiographic localization of the hybridi- 
zation site also results in rapid radiodecomposition of the 
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probe and a concomitant increase in the background noise of 
silver grain deposition. The use of hybridization probes 
with low to moderate specific radioactivities requires ex- 
posure times of many days or weeks, even to detect multi- 
copy sequences, such as ribosomal RNA genes or satellite 
DNA. Since recombinant DNA technology has made feasible the 
molecular cloning of virtually every single-copy sequence 
found in eukaryotic cells, it would be extremely beneficial 
to have a rapid and sensitive method for mapping the chro- 
mosomal origin of such cloned genomic fragments. 

Modified nucleotides may be used in a method of gene 
mapping by in situ hybridization which circumvents the use 
of radioisotopes. This procedure takes advantage of a 
thymidine analogue containing biotin that can be incor- 
porated enzymatically into DNA probes by nick translation. 
After hybridization in situ the biotin molecules serve as 
antigens for affinity purified rabbit anti-biotin anti- 
bodies. Immunof luorescent antibody sandwiches made with 
f luorescein-labeled goat anti-rabbit IgG allow for rapid and 
specific cytogenetic localization of cloned gene sequences 
as green-yellow bands. This method offers four major ad- 
vantages over conventional autoradiographic methods of in 
situ gene localization; .less background noise, an increase 
in resolving power between bands; a decrease in the time 
required to determine the site of probe hybridization; 
and chemically stable hybridization probes. This 
method has been applied successfully to the localization of 
reitterated and unique DNA sequences in the polytene chro- 
mosome of Drosophila milanogas ter and satellite DNA on 
mouse metaphase chromosomes. 

Thus it has been found that polytene chromosomes could be 
used as a test system for establishing the efficacy of 
probes using the modified nucleotides according to the 
instant invention as detected by indirect immunofluor- 
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escence for in situ gene mapping. The probes included a 
variety of cloned Drosophila sequences obtained from Otto 
Schmidt and Dieter Soli, such as tRNA genes cloned in 
plasrnid vectors with inserts of sizes ranging from about 5 
to about 22 kilobases. Many of these clones have already 
been assigned to specific bands on the Drosophila chromosome 
map by conventional in situ hybridization methods employing 
radioisotopes. •■ 

DNA probes were nick translated in the presence of Bio-dUTP. 
Occasionally 3 H dATP and/or 3 H dCTP was included in the nick 
translation reaction mixture. This allowed both autoradio- 
graphic and immunof luorescent localization of a se- 
quence on a single chromosone spread. in situ hybridi- 
zation was performed as described in M.L. Pardue, and J.G. 
Gall, Methods in Cell Biol., 10, 1 (1975)'. After the final 
2 x SSC wash to remove unhybridized probe, the slides were 
rinsed with PBS (phosphate buffered saline) and incubated at 
37° C with 2.5 p g/ml Rabbit anti-biotin in PBS and 10 mg/ml 
BSA for 2-16 hours. This was followed by incubation of the 
slides with FITC labeled Goat anti-Rabbit igG (Miles Labora- 
tories, diluted 1:100 in PBS and 10 mg/ml BSA) for one-four 
hours. Evans Blue was often required as a red counterstain 
to see the chromosomes with fluorescent illumination. 

When plasmids pBR 17D and pPW 539 containing 5 Kb and 22 Kb 
inserts, respectively, were hybridized by this method, it 
was found that, the pattern of hybridization is reproducible 
from spread" to spread and is observed unambiguously on 
greater than 90% of the chromosome spreads on a given slide. 

The cloned transposable element pAC 104 is known to map at 
many sites along the Drosophila genome. Comparison of the 
autoradiograph and the fluorescent picture obtained by in 
situ hybridization of this probe illustrates a major ad- 
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vantage of this method, i.e., that where diffuse regions of 
silver grains appear on an autor adiograph, doublets or a 
series of bands are discernible by immunof luor escent label- 
ing. 

The other immediately obvious advantage of this method is 
the tremendous decrease in time required for gene assign- 
ments to be made by indirect immunofluorescence. An as- 
signment of a DNA fragment to a specific band can be made 
within six hours of hybridization. This is in comparison to 
days or weeks required for autoradiographic exposure 
methods. This factor, in combination with increased resolu- 
tion, makes the use of modified nucleoti^ps detected by in- 
direct immunofluorescence immediately preferable to more 
classical methods. 

It has been shown that this immunological method also works 
with mammalian chromosomes wherein satellite DNA has been 
mapped to the centromeric regions of mouse metaphase chromo- 
somes. \ The result provides a basic foundation for the 
development of a simple gene mapping procedure for single 
copy (unique) sequences in chromosomes from human and other 
mammals. Such a procedure should greatly facilitate our 
understanding of the genetic organization of the chromosome 
and make clinical cytogenetic diagnosis much more rapid and 
practical . 

While a single-step "antibody sandwich" method in which the 
chromosome spread is challenged, post-hybridization, with 
rabbit anti-biotin IgG may succeed, this protocol may not 
generate sufficient fluorescence for unambiguous gene as- 
signments. However, a much stronger fluorometric signal can 
be achieved by using the "haptene-ant ibody sandwich tech- 
nique" described by Lamm, et al , , (1972); Wofsy, et al., 
(1974). In this procedure the primary antibody, in our case 
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monospecific, rabbit anti-biotin IgG, is chemically modified 
with a haptenization reagent, such as 2, 4-dinitrofluoro- 
benzene, preferably while the immunoglobulin is bound to an 
antigen affinity column (biot in-Sepharose TM) . As many as 
15-20 haptene (DNP) groups can be coupled to the primary 
antibody without decreasing its antigen binding affinity or 
specificity (Wallace and Wofsy, 1979). If the primary 
antibody treatment of the test' sample is followed by an 
incubation with a f luorescently labeled anti-hapten IgG 
antibody, rather than a f luorescently labeled anti-lgG, a 5- 
7 fold increase in fluorescence signal can be achieved. 
Since one also has available monospecific guinea pig anti- 
DNP IgG, we can haptenize this secondary antibody with bio- 
tin and thus generate two anti-hapten IgG populations, DNP- 
labeled anti-biotin IgG and biot in-labeled anti-DNP IgG. If 
these can be used alternately to achieve several rounds of 
hapten -antibody sandwiching and then followed with fluor- 
escently labeled protein A from Staphylococcus aureus , which 
binds specifically to IgG molecules from many mammalian 
species, it could result in an enormous amplification of the 
primary antibody signal with its concomitant utility. 



The protein streptavidin from Streptomyces avidini is a 
potential alternative to anti-biotin IgG as a vehicle to 
specifically direct a coupled visualization system [e.g., 
fluorescent probes (above) or histochemical reagents (below)] 
to the site of the hybridized biotin-containing polynucleo- 
tide. One of streptavidin ' s advantages over anti-biotin 
IgG is that its affinity for biotin is K assn = 10 15 whereas 
association constants for haptene-IgG interactions are 
10 7 to 10 10 . The fast reaction rate and extreme affinity 
mean that the time required to localize the biotinized 
probe will be minutes with streptavidin versus hours with 
35 immunologic reagents. 
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Initial evaluations of a streptavidin detection system are 
currently in progress. Polytene chromosomes hybridized with 
biot inized DNA probes will be incubated with streptavidin 
followed by a subsequent incubation with bovine serum albumin 
which has been doubly labeled with biotin and FITC (FITC, 
biotinyl-BSA) . Since only one of the four streptavidin 
subunits is likely to be involved in binding at each biotin- 
ized DNA site, potentially one labeled BSA molecule can bind 
to each of the remaining three noncon j ugated subunits of the 
str eptavidin-biot inyl nucleotide complex. The fluorescence 
signal from this single streptavidin + FITC/ biotinylBSA 
layer will be compared with a control using the basic 
"antibody sandwich method" described earlier. 

If the "antibody sandwich" and streptavidin + FITC , biotinyl- 
BSA detection intensities are comparable, one can attempt to 
enhance the streptavidin + FITC, biotinyl-BSA system to 
single-copy copy sensitivity in a manner that parallels the 
multiple "haptene-antibody sandwich" approach. Since some 
of biotin groups on BSA will not be bound to the first layer 
of streptavidin, a second layer of streptavidin can be added 
until sufficient signal is obtained. For example, if in the 
second layer, only two streptavidin protomers bind to each 
first-layer BSA and each of these streptavidin protomers 
binds three FITC-biot inyl BSA molecules, then the second 
layer intensity will be twice as great as that from the 
first layer; for the third layer, with analogous binding 
stoichiometr ies , the fluorescent intensity will be 12-fold 
that of the first layer, so the total intensity will rapidly 
increase with successively added layers. 

There are plans to use a larger carrier protein such as 
thyroglobul in rather than BSA in order to maxirhize amounts 
of attached fluorescent and biotin probes. It may also be 
necessary to use a longer linker arm between the biotin 
probe and the carrier protein. A longer linker arm should 
sterically optimize the theoretical delivery of a biotinized 
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fluorescent carrier molecule to each noncon j uga ted streptavidin 
subunit and maximize the number of streptavidin protomers 
in the subsequent layer which will bind to the biotinized 
fluorescent carrier. As before, appropriate controls will 
be done to insure that substitution of the carrier protein ^ 
with fluorescent probes and biotin does not cause solubility 
and/or nonspecific binding problems. 

The streptavidin-carrier protein delivery system has two 
significant advantages over the immunf luorescent approach 
in addition to its speed of delivery. First, only two 
protein components are needed to form the layers. Second, 
only the carrier protein needs to be modified and it is 
not necessary to maintain functional or even total structural 
integrity as long as the biotin groups are accessible to 
streptavidin. 



An alternative to the fluorescence method for visualizing 
hybridized probes is to direct enzymes such as peroxidase, 
alkaline phosphatase of B-galactosidase to the hybridization 
site where enzymatic conversion of soluble substrates to 
insoluble colored precipitates permits light microscope 
visualization. The important advantage of this technique 
is that the his tochemical methods are 10 to 100-fold more 
sensitive than fluorescence- detection. In addition, the 
colored precipitates do not bleach with extensive light 
exposure thus avoiding one of the general disadvantages 
of fluorescent light microscopy. These enzymes can be 
coupled to the final antibody instead of fluorescent probes 
in the "haptene-ant ibody sandwich" technique using bifunctional 
reagents such as glu tar aldehyde or in the case of peroxidase 
via oxidation of the peroxidase carbohydrate moieties to 
aldehydes and coupling of these residues with e-amino groups 
of the desired protein. For the str eptavidin-biot inized 
carrier protein method, an enzyme with biotinyl groups 
coupled to it could replace a f luor escently-biotinized 
carrier system. Alternately, the enzyme could be coupled 
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via biotin to the last layer of streptavidin with amplification 
of streptavidin sites being built up in preceding layers 
using biotinized BSA or thyroglobul in . We will begin develop- 
ing the necessary histochemical reagents and the appropriate 
substrate/insoluble product combinations for visualizing 
in situ hybridizations without background problems in the 
near future. The histochemical approaches to signal amplifica- 
tion should therefore be ready for trial in the summer 
of 1981. 



Detecting and/or imaging very low levels of fluorescent 
light is possible using currently available image intensifiers 
or systems composed of lasers and photomul tipl ier s . These 
methods permit the detection of light down to the level 
of individual photons. With suitable digital processing 
systems, images can be produced in which each point, i.e. 
each pixel, of the image is strictly proportional to the 
number of photons emitted by a point at the object. Using 
systems of this kind or flow systems in which the cells 
or parts of cells flow past a laser beam, one can obtain 
detection sensitivity increases for fluorescent material 
of factors between 100 and 1000 beyond that which can be 
detected by the eye. This increase is sufficient to detect 
the fluorescence of single copy genes. 

In a preferred modification, analogs of dUTP and UTP that 
contain a biotin molecule covalently bound to the C-5 posi- 
tion of the pyrimidine r ; ing through an allylamine linker arm 
have been synthesized. These biotinyl-nucleot ides are ef- 
ficient substrates for a variety of DNA and RNA polymerases 
in vitro. DNA containing low levels of biotin substitution 
(50 molecules or less/kilobase ) has denaturation , reassoc- 
iation and hybridization characteristics which are indis- 
tinguishable from that of unsubstituted control DNA. 

Thus, this invention also provides a method of chromosomal 
karyotyping. In this method, modified polynucleotides are 
prepared which correspond to known genes and include mod- 
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fied nucleotides. These polynucleotides are hybridized with 
chromosomal deoxyribonucleic acid and the resulting duplexes 
contacted with appropriate polypeptides under suitable con- 
ditions to permit complex formation. The polypeptides in- 
clude detectable moieties so that the location of the com- 
plexes can be determined and the location of specific genes 
thereby fixed. 

Another embodiment of this invention involves detection of 
poly A-containing sequences using poly U in which some of 
the uracil bases have been modified to contain a probe. Yet 
another emoodiment involves cyclic modified nucleotides in 
which two offx, y, andz are reacted to form the cyclic moiety 

K /> 

p 

O CHJ 

Such cyclic modified nucleotides may then be used to iden- 
tify hormone receptor sites on cell surfaces which in turn 
can be used as a method of detecting cancer or tumor cells. 

Finally, tumor cells can be diagnosed by preparing poly- 
nucleotides which are modified according to this invention 
and are complementary to. the messenger ribonucleic acid 
synthesized from a deoxyribonucleic acid gene sequence 
associated with the production of polypeptides, such 
as o-fetal protein or carcinoembryonic antigen, the pres- 
ence of which is diagnostic for specific tumor cells. Hy- 
bridization and detection of hybrid duplexes thus would 
provide a method for detecting the tumor cells. 

The examples which follow are set forth to illustrate var- 
ious aspects of the present invention but are not intended 
to limit in any way its scope as more particularly set forth 
in the claims. 
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ExamDle 1 and 2 
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Synthe sis of biotinyl - UTP and biotinyl - dUT P 

a) Preparation of Mercurated Nucleotides 

UTP (570 mg, 1.0 mmole) or dUTP 554 mg, 1.0 iranole) was 
dissolved in 100 ml of 0.1 M sodium acetate buffer pH 
6.0, and mercuric acetate (1.59 gm, 5.0 mmoles) added. 
The solution was heated at 50 °C for 4 hours, then cooled 
on ice. Lithium chloride (392 mg, 9.0 mmoles) was added 
and the solution extracted six times with an equal vol- 
ume of ethyl acetate to remo.ve excess HgCl 2 . The effi- 
ciency of the extraction process was monitored by esti- 
mating the mercuric ion concentration in the organic 
layer using 4, 4 '-bis (dimethylamino) -thiobenzophenone 
(A.N. Christoper, Analyst, 94, 392 (1969). The extent 
of nucleotide mercuration, determined spectrophotomet- 
rically following iodination of an aliquot of the aqueous 
solution as described by Dale et al. (R.M.K. Dale, D.C. 
Ward, D.C. Livingston, and E. Martin, Nucleic Acid Res. 
2_, 915 [1975]), was routinely between 90 and 100%. The 
nucleotide products in the aqueous layer, which often be- 
came cloudy during the ethyl acetate extraction, were 
precipitated by the addition of three volumes of ice- 
cold ethanol and collected by centrifugation. The pre- 
cipitate was washed twice with cold absolute ethanol, 
once with ethyl ether, and then air dried. These thus 
prepared mercurated nucleotides were used for the synthe- 
sis of the allylamine-nucleotides without further puri- 
fication. 

b) 

The mercurated nucleotides (of step a) were dissolved in 
0.1 M sodium acetate buffer at P H 5.0, and adjusted to a 
concentration of 20mM (200 OD/ml at 267 nm) . A fresh 2.0 





M solution of allylamine acetate in aqueous acetic acid 
was prepared by slowly adding 1.5 ml of allylamine (13.3 
mmoles) to 8.5 ml of ice-cold 4 M acetic acid. Three ml 
(6.0 mmoles) of the neutralized allylamine stock was added 
to 25 ml (0.5 mmole) of nucleotide solution. One nucleo- 
tide equivalent of K 2 PdCl- 4 , (163 mg, 0.5 mmole), dis- 
solved in 4 ml of water, was then added to initiate the 
reaction. Upon addition of the palladium salt (Alfa- 
Ventron) the solution gradually turned black with metal 
(Hg and Pd) deposits appearing on the walls of the re- 
action vessel. After standing at room temperature for 
18-24 hours, the reaction mixture was passed through a 
0.45 mm membrane filter (nalgene) to remove most of the 
remaining metal precipitate. The yellow filtrate was di- 
luted five-fold and applied to a 100 ml column of DEAE- 
Sephadex TM A-25 (Pharmacia). After washing with one 
column volume of 0.1 M sodium acetate buffer at pH 5.0, 
the products were eluted using a one liter linear gradi- 
ent (0.1-0.6 M) of either sodium acetate at pH ^ 8-9, 
or triethylammonium bicarbonate (TEAB) at pH 7.5. The 
desired product was in the major UV-absorbing portion 
which eluted between 0.30 and 0.35 M salt. Spectral ana- 
lysis showed that this peak contained several products, 
final purification was achieved by reverse phase - HPLC 
chromatography on columns of Partisil - ODS2, using either 
0.5H NH 4 h 2 P0 4 buffer at pH 3.3 (analytical separations), 
or 0.5 M triethylammonium acetate at pH 4.3 (preparative J J 

separations) as eluents. The 5 ' -triphosphates of 5- 
C5 -aminopropen-l-yl) uridine (the allylamine adduct to l^ir^' 
uridine) were the last portions to be eluted. from the HPLC^Vl ^ 
column and they were clearly resolved from three, as yet uncha- 
racterized, contaminants. These nucleotides were charac- 
terized by proton NMR elemental analysis t AA-dUTP (C 



H 16 N 3 °14 P 3 Na 4* 1 H 2 0) : theor Y C, 22.91; H, 2.88; N, 
6.68; F, 14.77. Found, C, 23.10; H, 2.85; N , 6.49; P, 14.75. 
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AA-UTP (C 12 H lg N 3 0 15 P 3 Na 4 . 4H 2 0) : Theory, C 20.61; 
H, 3.46; N, 6.01; P, 13.3. Found C, 20.67; H, 4.11; 
N, 5.39; P, 13.54] spectrally and chromatographically . 

c) Biotination of AA-dUTP or AA-UTP 

Biotinyl-N-hydroxysuccinimide ester (NHSB) was prepared 
from biotin (Sigma) as described previously (H. Heitzmann 
and P.M. Richards, Proc. Natl. Acad. Sci. USA. 71, 3537 
[1974]). AA-dUTP*H 2 0 (63 mg, 0.1 mmole) or AA-UTP-,4H 2 0 
(70 mg, 0.1 mmole) was dissolved in 20 ml of 0.1 M sodium 
borate buffer at pH 8.5, and NHSB (34.1 mg, 0.1 mmole) 
dissolved in 2 ml of dimethyl formamide, was added. The 
reaction mixture was left at room temperature for four 
hours and then loaded directly onto a 30 ml column of 
DEAE-Sephadex TM A-25, preequilibrated with 0.1 M TEAB 
at pH 7.5. The column was eluted with a 4 00 ml linear 
15 gradient (0.1-0.9 M) of TEAB • Fractions containing 

biotinyl-dUTP or biotinyl-UTP , which eluted between 0.55 
and 0.65 M TEAB, were desalted by rotary evaporation in 
the presense of methanol and redissolved in water. Oc- 
caionally a slightly cloudy solution was obtained: this 
2 0 turbidity, due to a contaminant in some TEAB solutions, 
was removed by filtration through a 0.45 mm filter. For 
long term storage, the nucleotides were converted to the 
sodium salt by briefly stirring the solution in the pre- 
sence of Dowex TM 50 , (Na + form). After filtration the 
2 5 nucleotide was precipitated by the addition of three vol- 
umes of cold ethanol, washed with ethyl ether, dried in 
vacuo over sodium hydroxide pellets, and stored in a des- 
sicator at -20 °C. For immediate use, the nucleotide so- 
lution was made 20 mM in Tris-HCl at pH 7.5, and adjusted 
30 to a final nucleotide concentration of 5 mM. Stock solu- 
tions were stored frozen at -20 °C. 

Elemental analysis of the bio-dUTP and bio-UTP products 
yielded the following results. Bio-dUTP (C 2 2 H30 N5 
018 ?3 Si Na4- 1 H2O) . Theoretical; C, 29.80; H, 3.38; 




N, 7.89; P, 10.47; S. 3.61. Found; C, 30.14 H, 3.22; N, 
7.63; P, 10.31; S, 3.70. Bic-UTP (C 22 H 30 N 5 O19 P3 Si 
Na4 -3 H 2 0) ; Theoretical; C, 29.15; H, 3.19; N, 7.45; 
P, 9.89; S, 3.41. Found; C, 28.76; H, 3.35; N, 7.68; 
5 P, 9.81; S, 3.32. 

The spectral properties of bio-dUTP and bio-UTP at pH 7.5 
[ * max, 289 nm ( e = 7,100); A max, 240 nm ( £ =10,700); 

X min, 262 nm ( e=4,300)] reflect the presence of an 
exocylic double-bond in conjugation with the pyrimidine 

10 ring. These nucleotides also give a strong positive re- 
action (an orange-red color) 'when treated with p-dimethyl- 
aminocinnamaldehyde in ethanolic sulfuric acid, a proce- 
dure used for biotin quantitation (D.B. McCormick and J. A. 
Roth, Anal. Biochem. , 3£, 326, 1970). However, they no 

15 longer react with ninhydrin, a characteristic reaction 
of the AA-dUTP and AA-UTP starting materials. 

Examples 3 and 4 

20 Synthesis of biotinyl-CTP and biotinyl-dCTP 

CTP and dCTP were a) mercurated, b) reacted with allyla- 
mine, and c) biotinized with NHS-biotin, essentially as 
described in Example 1. CTP (56.3 mg, 0.1 mmole) or dCTP 
(59.1 mg, 0.1 mmole) were dissolved in 20 ml of 0.1 M 

25 sodium acetate buffer at pH 5.0, and mercuric acetate 
(0.159 gm, 0.5 mmoles) added. The solution was heated 
at 50°C for 4.5 hours then cooled on ice. Lithium 
chloride (39.2 mg, 0.9 mmoles) was added and the solution 
extracted 6 times with ethyl acetate. The nucleotide 

30 products in the aqueous layer were precipitated by the 
addition of three volumes of cold ethanol and the pre- 
cipitate collected by centrif ugation . The precipitate 
was washed with absolute ethanol, ethyl ether, and then 
air dried. These products were used without further 

35 purification for the synthesis of AA-CTP and AA-dCTP, 





respectively. The mercurated nucleotides were dissolved 
in 0.1 M sodium acetate buffer at pH 5.0 and adjusted 
to a concentration of 10 mM (92 OD/ml at 275 nm) . 0.6 
ml (1.2 mmole) of a 2.0 M allylamine acetate stock (pre- 
5 pared as described in Example 1) was added to 10 ml of 

nucleotide solution (0.1 mmole) followed by the addition 
of K^PdC^ (32.6 mg, 0.1 mmole), dissolved in 1.0 ml of 
H2O. After standing at room temperature for 24 hours, 
the solution was filtered through a 0.45 mM membrane to 

10 remove metal precipates. The filtrate was diluted five- 
fold and loaded onto a 50 ml- column of DEAE-sephadex A-25, 
preequilibrated with 50 mM TEAB at pH 7.5. The nucleotide 
products were fractionated by application of a 500 ml lin- 
ear gradient (0.05-0.6 M) of TEAB at pH 7.5. The desired 

15 product was in the major UV absorbing portion which eluted 
between 0.28 and 0.38 M salt. The pooled samples were de- 
salted by rotary evaporation, dissolved in 0.5 M triethyl- 
, ammonium acetate at pH 4.2, and final purification achieved 
by HPLC chromatography on columns of Partisil ODS-2 , using 

2 0 0.5 M triethylammonium acetate as the eluent. Appropriate 
fractions were pooled, lyophilized, and the products dis- 
solved in H2O. The nucleotides were converted to the Na + 
salt by stirring briefly in the presence of Dowex TM 50 
(Na+ form). After filtration, to remove the Dowex resin, 

25 the nucleotides were precipitated by the addition of 3 

volumes of cold ethanol. The precipitate was washed with 
ether and then air dried. Analytical results: AA-dCTP 
(C 12 H 17 N 4 °13 p 3 Na 4 • 2H2O) ; Theory, C, 22.29; H, 
2.63; N, 8.67, P, 14.40. Found C, 22.16; H. 2.89; N. 

30 8.77; P, 14.18. AA-CTP (C 12 H 17 N4 0 14 Na 4 . 2H 2 0) ; 

Theory C, 21.75; H, 2.57; N, 8.46; P, 14.01. Found, C, 
22.03; H, 2.47; N, 8.69; P, 13.81; Spectral properties 
in 0.1 M Borate buffer at pH 8.0, A max 301 nm ( e =6,400), 
\ min 271 nm ( e =3,950) \ max 250 nn ( e=9,700). Both 

35 AA-dCTP and AA-CTP give a positive ninhydrin test. . 
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AA-CTP (6.6 nig, 0.01 mmole) or AA-dCTP (6.4 mg, 0.01 mmole) 
was dissolved in 5 ml of 0.1 M sodium borate buffer at pH 
8.5, and NHS-biotin (3.4 mg, 0.01 mmole), dissolved in 0.2 
. ml of dimethyl formamide, was added. After sitting at 
5 room temperature for 4 hours the sample was chromatographed 
on a 10 ml column of DEAE-Sephadex A-25, using a 150 ml 
linear gradient (0.1-0.9 M) of TEAB at pH 7.5, as eluent. 
Fractions containing biotinyl-CTP or biotinyl-dCTP , which 
eluted between 0.50 and 0.60 M TEAB, were pooled, desalted 

10 by rotary evaporation, and after being adjusted to a final 
concentration of 5 mM in 0.02 M Tris-HCl buffer at pH 7.5, 
were frozen at -20 °C. The products give a strong positive 
reaction for biotin with p-dimethylaminocinnamldehyde in 
ethanolic sulfuric acid but give a negative test for pri- 

15 mary amines when sprayed with ninhydrin. Further struc- 
tural characterization of these products is in progress. 

Examples 5 and 6 

2 0 Synthesis of Iminobiotinyl-UTP and Iminobiotinyl-dUTP 

Iminobiotin hydrobromide was prepared from biotin as de- 
scribed previously (K. Ilofmann, D.B. Melville and V. du 
Vigneaud, J. Biol. Ghent, 141, 207-211, 1941; K. Hofmann 
and A.E. Axelrod, Ibid., 187 , 29-33, 1950). The N-hy- 

25 droxysuccinimide (NHS) ester of iminobiotin was prepared 
using the protocol previously described for the synthesis 
of NHS-Biotin (H. Heitzmann and F.M. Richards, Proc. Nat. 

Acad. Sci. USA, 71, 5537, 1974). AA-UTP (7.0 mg, 0.01 
mmole) or AA-dUTP (6.3 mg, 0.01 mmole), prepared as de- 

30 tailed in example 1 (part b) , was dissolved in 5 ml of 

0.1 M sodium borate buffer at pH 8.5, and NHS-iminobiotin 
(3.5 mg, 0.01 mmole), dissolved in 0.5 ml of dimethyl- 
formamide, was added. The reaction mixture was left at 
room temperature for 12 hours and then loaded directly 

35 onto a 10 ml column of DEAE-Sephadex A-25, preequilibrated 




' # ' ; ' ' ' • 

- 47 - 

i ■ 

with 0.05 M TEAB at pH 7.5. The column was eluted with 
a 150 ml linear gradient (0.05-0.6 M) of TEAB. Fractions 
containing iminobiotin-UTP or iminobiotin-dUTP , which 
eluted between 0.35 and 0.4 0 M TEAB , were desalted by ro- 
5 tary evaporation in the presence of methanol and dissolved 
in H2O. The products contained a small amount of allyla- 
mine-nucleotide adduct as an impurity, as judged by a 
weak positive result in the ninhydrin test. Final puri- 
fication was achieved by affinity chromatography on avidin- 

10 sepharose. Fractions of the impure product, made 0 . 1 M in 
soldium borate buffer at pH 8 . 5 , were applied to a 5 ml 
column of avidin-sepharose and washed with 2 5 ml of the 
same buffer. The column was then washed with 50 mM ammonium 
acetate buffer at pH 4.0, which eluted the desired imino- 

15 biotin-nucleotide product in a sharp peak. The nucleotide 
was precipitated by the addition of 3 volumes of cold 
ethanol, washed with ethylether, dried in vacuo over so- 
dium hydroxide pellets and stored in a dessicator at -20 °C. 
Products were characterized by elemental analysis, as well 

20 as by spectral and chromotographic properties. 

Examples 7 and 8 

Synthesis of NAGE-UTP and NAGE-dUTP 

25 Allyl (3-amino-2-hydroxy-)'propyl ether, abbreviated NAGE , 
was prepared from allyl glycidyl ether (Age) (obtained 
from Aldrich Chemical Co.). 10 ml of Age (84 mmole) was 
added slowly (in a fume hood) to 50 ml of 9 M ammonium 
hydroxide and the mixture allowed to stand at room tempera- 

30 ture for six hours. Excess ammonia was removed by rotary 
evaporation under reduced pressure to yield a viscous yel- 
low oil. Analysis of this product by proton NMR showed 
that it possessed the required structure. 5-mercuri-dUTP 
(0.1 mmole) or 5-mercuri-UTP (0.2 mmole) was dissolved in 

35 2-4 ml of 0.2 M sodium acetate buffer at pH 5.0, and a 16 
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fold molar excess of NAGE adjusted to pH 5.0 with 
acetic acid prior to use, was added. The final reaction 
volumes (4.3 and 8.4 ml) had nucleotide concentrations 
of 43 and 42 mM, respectively. One equivalent of K2PdCl4 
(0.1 or 0.2 mmoles) was added to initiate the reaction. 
After standing at room temperature for 18 hours, the re- 
action mixtures were filtered through 0.45; jjiUM membranes 
the samples diluted five-fold, and chromatographed on 
columns of DEAE-Sephadex A-25, using linear gradients 
(0.1-0.6 M) of sodium acetate. Fractions containing ttfe 
desired products, as judged by their UV spectra and cha- 
racteristic HPLC elution profiles on Partisil ODS-2, were 
pooled, diluted, and further purified by rechromatography 
on DEAE-Sephadex using shallow gradients (0.1-0.5 M) of 
15 ammonium bicarbonate at pH 8.5. Under these conditions 

the majority of the, NAGE-dUTP (or NAGE-UTP) could be cleanly 
separated from residual impurities. Proton NMR spectra 
were obtained at this stage of purification after the 
nucleotides were lyophilized and redissolved in D2O. For 
elemental analysis, the products were converted to their 
sodium salt form. Typical analytical results: Nage-dUTP 
< c 15 H 22 N 3 0 16 , p 3 Na 4 • 2 H 2 0) , Theory, C, 24.99; H, 3.63; 
N, 5.83; P,. 12.88. Found, C, 25.39; H, 3.71; N, 5.63; P, 



12.88 



E xample 9 

Uses of Labeled DNA Sequences 
I , Karyotyping 

(a) Select from a human gene library some 100 to 200 clones. 
Label them as described above, and for each clone locate 
its place or places of hybridization visually or with a 
low-light-level video system. For those clones which 
correspond to a unique sequence gene this determines the 
location of the cloned DNA on a particular human chromo- 
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some. Obtain several clones for each chromosome. Each 
of these labeled clones can be used to identify particu- 
lar chromosomes. They can also be used in combination 
to identify each of the 4 6 chromosomes as being one of 
the 22 autosomal pairs or the X or the ;Y. By allowing 
one set of labeled clones to hybridize to the chromo- 
somes and then adding a fluorescent stain to the label, 
the set of clones and their locations can be visualized 
and will flouresce with a particular color. A second set 
of labeled clones could then be used and reacted with a 
second fluorescent dye. The same process can be repeated 
a number of times. Thus one can, if desired, have sev- 
eral sets of fluorescent labels attached to the cellular 
DNA at different but specific locations on each of the 
chromosomes. These labels could be used for visual or 
computerized automatic karyotyping . 

(b) For automatic karyotyping, one could use one set of 
clones to identify the approximate location of each of 
the 4 6 chromosomes by finding sets of spots corresponding 
to the number of labeling sites on each chromosome. Thus, 
it is possible by computer analysis of the digitized images 
to determine if the chromosomes are suitably spread for 
further analysis. If they are suitably spread/ then one 
can use computer analysis to identify each of the individ- 
ual chromosomes by the location and distribution of the 
labelled spots on each one. 

By using the fact that the fluorescent spots can be placed 
at specific locations on each chromosome/ one can carry out 
either manual or automatic karyotyping very much more 
effectively than without such labels. 

II- Diagnosis of Genetic Disorders 



By selecting the clones which bind specifically to a 
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particular chromosome/ such as number 23, it is possible 
to count the number of copies of the particular chromo- 
some in a cell even if the chromosomes are not condensed 
at metaphase. Thus when fetal cells are obtained for 
prenatal, diagnosis of trisomy 21, the diagnosis can be 
done even if the chromosomes are not condensed at meta- 
phase. If necessary, two sets of labels can be used — 
one which would be specific for chromosome 2 3 and one for 
some other chromosome. By measuring in each cell the 
ratio of the two labels, which might be of different 
colors, it is possible to identify the cells which show 
an abnormal number of chromosomes number 23. This pro- 
cedure could be used either on slides with a low-light- 
level video system or in a flow cytometer system using 
laser excitation. It can be used to determine any ab- 
normal chromosome number. 

Microogranism Detection and Identification 

The labeling of specific sequences of DNA as described 
above permits identification and counting of individual 
bacteria. In order to identify the individual bacteria 
to which a particular fragment of DNA hybridizes the sensi- 
tivity must be such that a single labelled structure can 
be detected. This can be done using a low-light-level 
video system and computer summation of images, or by 
using some other device for intensifying the light image. 
A flow system can also be used if the sensitivity can be 
made sufficiently grand. If one immobilized the bacteria 
on a slide their location could be found and the number 
of such fluorescent spots counted. This would provide a 
count of all of those bacteria which contain DNA which 
can hybridize whith the specific clone utilized. If the 
clone is selected as being specific for a particular strain 
or bacteria, then one can count the number of organisms of 
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that strain. In addition, any antibiotic resistance for 
which a particular gene has been identified could be 
characterized in a similar way using, as a probe, the 
DNA sequence which is contained in the antibiotic re- 
sistance gene. In addition, a probe could be used which 
is specific for a resistance plasmid containing one or more 
antibiotic resistance genes. In addition to individual 
bacteria, groups of bacterial cells of a particular 
strain can be detected and their number estimated if 
they are located in a small spot so that the total flu- 
orescence specific to the hybridized DNA in the spot can 
be measured. In this way the number of organisms con- 
taining a specific DNA sequence can be measured in a mixture 
of bacteria. 
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